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FOREWORD 



This report describes the development of the 1969-70 Michigan 
Educational Assessment measures which were used in assessing the 
levels and distribution of educational performance for Michigan’s 
districts, schools, and pupils. It was prepared jointly by Edu- 
cational Testing Service and the Michigan Department of Education, 
and is published by the Michigan Department of Education. 

The contents of this document are intended to fulfill the 
requirements of the Standards for Educational and Psychologi cal 
Tests and Manuals . Therefore, it is technical in nature and is 
intended for persons interested in the specifications and psycho- 
metric properties of the tests. 

Thanks are due to a large number of individuals and groups 
for making the Michigan Educational Assessment Program a reality: 
The State Board of Education for proposing it, the Governor and 
Legislature for actively supporting it, and Michigan educators 
for assisting with it. The program was designed and administered 
by Research, Evaluation, and Assessment Services, Michigan Depart- 
ment of Education, with the assistance of Educational Testing 
Service, and counsel of several ad hoc advisory groups. 

John W. Porter 

Superintendent of 
Public Instruction 



TECHNICAL REPORT OF SELECTED ASPECTS OF 
THE 1969-70 MICHIGAN EDUCATIONAL ASSESSMENT PROGRAM 

Introduction 

An assessment program is of value only when those who are to develop 
a course of action as a result of its implementation fully understand the 
construction and scoring of the instruments used in the program and the 
extent to which the battery has achieved its purposes. Consequently, 
this report attempts to answer three questions! How was the battery 
constructed? How were scores derived? How well did the battery succeed 
in achieving the intended goals? 

This report has four sections. The first section contains a brief 
description cf the 1969-70 assessment program, including a statement of 
purposes and a summary of procedures. 

The second section explains in detail the construction of both the 
achievement portion of the battery and the pupil questionnaire. Con- 
struction of the four achievement tests is described in terms of content, 
item-writing, test assembly methods, and review procedures. In addition, 
sources of questionnaire items and methods of selecting those questions 
to be included are described. 

Section three describes the development of the scaled scores arising 
from the statistical treatment of responses to the items in the various 
parts of the battery. Nine such scores were developed. Five are concerned 
with achievement (vocabulary, reading, English expression, mathematics, 
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and composite) ; four are concerned with student perceptions of educational 
correlates (socio-economic status, the importance of school achievement, 
self-perceptions, and attitude toward school). 

Section four deals with the characteristics of the battery components. 
Describing the characteristics cf the achievement tests and the question- 
naire, this section enables the reader to evaluate the aspects of the 
program that are dependent upon student responses. 

Description of the Assessment Program 

The plans and procedures of the 1969—70 Michigan Assessment Program 
were designed to determine the levels of educational performance in basic 
skills not only for the state as a whole but also for various subdivisions 
within the state. The program also called for the development and 
institution of means of. determining the status and distribution of several 
educational correlates within the state and within those same subdivisions. 
Implementation of the plans called for the development of achievement and 
attitude measures to be administered to scudents and for the creation of 
a data-file containing information regarding school district financial 
and human resources. The achievement and attitude measures were developed 
by Educational Testing Service in consultation with the State Department 
of Education. The information concerning school district resources was 
furnished by the State Department from existing records. 

The assessment battery was administered to fourth and seventh grade 
pupils in January of 1970. Table 1 shows the number of school districts, 
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schools, and students in the fourth and seventh grades that took part in 
the 1969-70 assessment program. 



TABLE 1 



Grade 


* 

School Districts 
N 


* 

Schools 

N 


Pupils 

N 


4 


585 


2,492 


158,713 


7 


566 


909 


159,407 



With enrollment of at least 5 students. 

Construction of the 1969—70 Assessment Battery 



Achievement Battery 

Before the process of constructing the 1969-70 assessment battery 
is described, regrettable fact must be brought to the reader’s attention. 
Because of the lateness of the funding authorization, the period of time 
allowed for test construction was altogether too brief. Although every 
effort was made to involve as many Michigan educators as possible in the 
process, the time restriction did not allow for as widespread participation 
of this group as would have been desirable. 

Several processes in the development of tests in the achievement 
areas covered by the Michigan Assessment Program were the same regardless 
of the subject under consideration. These processes are described below 
and are followed by a detailed description of the tests in each area: 
vocabulary, reading, English expression, and mathematics. 
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The first step in the construction of any test is the development 
of specifications. Test specifications have been likened to a skeleton. 
They are the bones or the framework that determine the shape and structure 
of the finished product. Test specifications set forth in detail the 
content, coverage, and emphasis of the test. For example, test specifica- 
tions for a reading test would include the type of material with which the 
student must work (words, sentences, paragraphs), the type of knowledge 
called for (synonyms, associations, factual, interpretation), and the 
number of questions involving each type of material and each type of 
knowledge to be included in the test. So that the achievement tests used 
in the Michigan Assessment Program might accurately reflect the objectives 
of education in the state, the professional staff of each of the several 
departments of the Test Development Division of Educational Testing Service 
wrote a set of preliminary test specifications based on texts in use in 
Michigan schools. 

Text-based specifications require a careful examination of the text- 
books that are in use and consultation with educators who are responsible 
for instruction. The purposes of such procedures are to bring into sharper 
focus the specific objectives in each of the subject-matter areas under 
consideration, to determine the relative emphasis given to each topic 
within any given curriculum, and to point up the extent to which any 
innovative programs (e.g., modern mathematics) are being adopted throughout 
the state. 

The preliminary test specifications so developed were reviewed by the 
ad hoc Assessment Battery Specifications Committee appointed by the 
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Mlchigan Department of Education. Members of this committee suggested 
several revisions for the improvement of the content and coverage of the 
proposed tests. Educational Testing Service revised the test specifications 
in accordance with these suggestions. Content specifications for the 
reading, English expression, and mathematics tests are presented in 
Appendix A. 

The second standard step in test development is the writing and 
assembly of items or questions of the type and in the quantity called for 
by the test specifications. The ETS procedure for writing items is as 
follows: An individual is assigned the task of producing a certain number 

of items along stringent lines carefully laid out in the specifications. 
After writing the items, he submits them to at least two other staff 
members for review. 

The review process for both the test questions and the tests them- 
selves involves more than a mere reading of the material. Some of the 
considerations borne in mind in both instances are: 

1. Does each item have one and only one correct answer? 

2. Is the language simple, direct, and free of ambiguity? 

3. Is each item independent of other items? (It: is not 
fair for the student to get two items right (or two 
wrong) because the same information is needed to 
answer both items.) 

4. An item should be difficult because it requires 
sophisticated reasoning, not because it tests obscure 
or esoteric subject matter. 
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5. Do the items cover a range of difficulty? Are they 
ordered from easy to hard? 

6. Does the test meet the content specifications? 

7. Is the content of any item "unfair" or "emotionally 
offensive" to any segment of the population? 

Continuing the question-writing phase of test development, the items are 
returned to the individual who wrote them. He makes whatever changes are 
necessary along the lines suggested by reviewers. 

A different staff member is assigned the task of assembling the test, 
which is the third step in test development. He receives and reviews each 
item, making changes, rejecting items considered unsuitable, and putting 
finishing touches on the items to be used. After the test is assembled, 
he submits the test for a Test Specialist review. Here the test as a 
whole is evaluated and each individual item again subjected to scrutiny. 



The test is then reviewed in turn by the department chairman, the 

coordinator of the program for which the test is being used, and the 

editorial staff. After each review, changes are made in line with the 

suggestions of the reviewer. Upon completion of the process within the 

* 



Test Development department, the test is submitted for review by a 
committee of those responsible for using the test; i.e., the Assessment 
Battery Specifications Committee in this case. The committee evaluates 
the test and makes suggestions for changes that are carried out by the 
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Test Content: Vocabulary 

The vocabulary section of the Michigan Assessment of Basic Skills 
Test for 1969-70 consists of 50 analogy questions. Each question begins 
with two words that go together in a certain way. Under them, there are 
four other pairs of words lettered A, B, C, and 0. The student is asked 
to find the lettered pair of words that go together in the same way as 

the first pair of words. For example: CALFtCOW:: 

(A) puppy: dog 

(B) nest third 

(C) horse :bull 

(D) shell: turtle 

The first pair of words go together in this way: A calf is a young cow. 

The only lettered pair of words that go together in this way is puppy:dog; 

that is, a puppy is a young dog. 

Items of this type use words that are recognizable by the population 
being tested. The student is asked, however, to be able to think in 
somewhat greater depth than simple recognition; he must be able to tell 
in what way a word is similar in meaning to, or different in meaning from, 

other vords. 

To apply learning gained in one situation to other similar situations, 
one must be able to see in what way similarities exist. One may learn 
that two apples and two apples make four apples, but to use this knowledge 
he must be able to see that two pears (or two of anything else) have 
certain similarities to two apples although apples are not pears. It is 
this kind of learning that most educators strive for rather than rote 
memorization of specific bits of information. 
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T&st Content : Reading 

The reading section of the Michigan Basic Skills Test contains ten 
word-association items, ten sentence items, and twenty questions based on 
reading passages. The word-association items indicate the students 
ability to read and understand the meaning of individual words. The 
sentence items measure both the ability to read and understand the meaning 
of a sentence and to derive the meaning of a word in the context of a 
sentence. The reading passages measure the students’ ability to read a 
short p3.ss3.go and understand and interpret facts# 

In the word-association test, the student is asked to read a word 
and then choose another word or phrase that goes best with it. The word 
that goes "best" with the first word is one of thi*e types. It may be a 
word that means almost the same thing as the first word (cheerful :happy) ; 
it may be a phrase that illustrates an emotion or action appropriate to 
the first word (cry :boo-hoo) ; or it may be one or more words that describe 
something commonly associated with the first word (asksMay I?). 

The sentence items are of two types. In oue type, inference, the 
c-udent is asked to pick a word which describes what has happened in the 
sentence. For example: "The car would not move because the wheels sank 

down into the mud." "The car was "Stuck" is the answer which 

the student must pick from three other options. 

A second type of sentence, comprehension, asks that a student under- 
stand a word contained in a sentence, the meaning of which is largely 
explained by the rest of the sentence. The Allowing is an example. 
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Sally was miserable because she had the measles and could not 
go to the party. 

Miserable means 



(A) happy 

(B) sad 

(C) calm 

(D) smiling 



The section with reading passages contains six short passages, each 
followed by questions. Approximately two-thirds of the questions have to 
do with understanding the factual contents of the passage; one-third asks 



for siomething that was either implied or could be inferred from the passage. 
Both types of questions are illustrated in the following example: 

Many bodies of water are called seas, but the term l, the 
Seven Seas" refers to the largest of all, the oceans: the 

Arctic and Antarctic, the North and South Atlantic, the North 
and South Pacific, and the Indian. 

The Indian Ocean takes its name from the country along 
its northern shore, India, whose name in turn came from 
"Sindhu," meaning river. India was called, in olden days, 

"the land beyond the river." 

In the Middle Ages, the Atlantic Ocean was called "the 
sea of darkness." Men knew its eastern shore, but beyond 
there lay darkness, the darkness which came from ignorance 
and fear. 



The Pacific, the largest ocean of all, was first seen , 
by a Spanish explorer who named it "Mar del Sud" — the 
Sea of the South. A later explorer named it "Pacific," 
meaning peaceful, because of the serenity of its surface. 

(1) Which of these is NOT one of the Seven Seas? 



(B) Indian 



(C) Baltic 





(A) Atlantic 



(D) Arctic 
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(2) Why is the Pacific Ocean called "Pacific? 

(A) It is in the North. 

(B) It divides the continents. 

(C) It is large and dark. 

(D) It looks serene. 

Test Content: English Expressio n 

There are four parte to the English Expression Test: Spelling, 

Effectiveness of Expression, Grammar and Usage, and Capitalization. As 

is indicated below, the number of questions varies with the grade level 

for which the test is intended. 

A. Spelling (Grade 4, 15 items; Grade 7, 20 items) 

The questions for this part present four words, one of 
which may be misspelled. Sources of spelling error 
include such possibilities as misunderstanding of th? 
rules for word formation, reversing of letters, and 
spelling by sound alone. 

Example: (A) monney 

(B) funny 

(C) sunny 

(D) no error 

B. Effectiveness of Expression (Grades 4 and 7, 10 items) 
Questions testing effectiveness of expression ask the 
student to demonstrate his sensitivity to language by 
selecting the wording which, in addition to being 
grammatically correct, is best in sentence structure 
and word order and is most precise in idiom and diction. 
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Example: (A) Her car is red like this here one. 

(B) Her car is red like this one. 

(C) This car is red like the one of hers. 

(D) Like her car, this here one is red. 

C. Grammar and Usage (Grade 4, 10 items; Grade 7, 13 items) 

The questions for this part assess the students' ability 
to work with such matters of appropriate sentence 
construction as subject-verb agreement, adjective forms 

a 

(including a and an), and, for grade seven only, agree- 
ment between a pronoun and its antecedent. 

Example: Before he ran for office, Mr. Lloyd had 

A 

spoke out against high taxes. No error . 

B C D 

D. Punctuation and Capitalization (Grade 4, 10 items; Grade 7, 

12 items) 

These questions assess the students' mastery of such common 
matters as the use of capital letters to begin proper names 
and to signify the first-person "I”, the use of end-of- 
sentence marks, and the use of the apostrophe to form 
possessives and contractions. 

Example: (A) My teacher's name is Mrs. Smith. 

(B) My teacher's name is Mrs Smith. 

(C) My teachers name is Mrs Smith. 

(D) My teachers name is mrs. Smith. 

The English Expression Test is not a direct measure of a student's 
ability to write. It is not, and it does not purport to be, a test of a 
student's ability to think imaginatively, to write "creatively," or to 
function in his own won. Id of communication. Rather, it is a valid and 
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reliable predictor of a student's mastery of certain aspects of the 
language arts curriculum and of his ability to produce the kind of 
language that is often called "standard written English," the language 

i 

of most of the books that students read for school. 

Test Content: Mathematics 

The mathematics section of the Michigan Basic Skills Test contains 
30 questions arranged in order of difficulty. Although most of the 
questions are of middle-difficulty, they range from very easy ones that 
require little more than recall or recognition to some challenging ones 
that test reasoning ability. The majority of the items are designed to 
test comprehension of concepts that have been taught up to the appropriate 
grade level. 

The test does not emphasize computation; however, care was exercised 
to include as many different number facts as possible in the course of 
solving problems. The problems are phrased in simple and direct language 
that would not present a reading problem to most students. Only the more 
generally recognizable symbols of mathematical notation are used. 

For each question, the three incorrect options were determined by 
considering the common misconceptions students have, the most prevalent 
computation errors they make, or the answers they would get if they chose 
the wrong method of solving the problem. The four options are arranged 
in order from least to greatest value or vice-versa, except in those cases 
where determining the order is precisely what is being tested. There 
follows two example items of the type in the mathematics section. 
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Example: A - 14 = 61 

What number does represent? 

(A) 47 

(B) 75 

(C) 79 

(D) 84 

Example: If N is a whole number greater than 10, which of the 

following number Is least? 

(A) N - 1 

(B) N - 4 

(C) N - 9 

(D) N + 3 

Construction of Pupil Questionnaire 

Plans for the 1969-70 Michigan Assessment Program called for a 
preliminary investigation of the status of several educational correlates, 
their distribution, and their possible relationships to basic skills 
achievement. Consequently, the literature on the correlates of school 
performance was reviewed. This review pointed up the fact that the 
questions posed in the study conducted by Coleman and others^ have proved 
to be the most useful in those situations where the information desired 
is to be gotten from students. The responses of the national sample of 
sixth-grade students to questions from this study were examined, and 
those that seemed appropriate and that differentiated well among various 
segments of the population were adopted for use in this program. To the 



^James S. Coleman and others, Equality of ducational Opportunity , 
Washington, D. C. , U. S. Government Printing Office, 1966 
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questions thus gleaned, several others, written by ETS professional staff 
members and approved by the Michigan Department of Education, were added. 

The resulting questionnaire, labeled "General Information," consists 
of 26 questions designed to provide measures of such educational correlates 
as family socio-economic status, attitude toward school, educational 
aspirations, and self-concepts. Examples of the questions included are: 

Does your family have a dictionary? 

Do you like school? 

Are you planning to go to college? 

Do most of your classmates like you? 

Development of Scaled Scores 



Basic Skill Scores 

The score that is obtained by counting the number of questions a 
student answered correctly is a raw score. Raw scores always refer to a 
particular test and can never be compared to raw scores on any other test. 
Identical raw scores obtained on different tests may represent different 
levels of performance, depending on the length of the tests and their 
relative difficulties. Therefore, the raw scores on each of the basic 
skills tests in the Michigan Assessment Battery were transformed to standard 
or scaled scores with an average of 50 and a standard deviation of 10. The 
equations for transforming the raw scores on each test at each grade level 
are tabled ii appendix B. 

The transformation of raw scores to standard scores maxes them 
comparable; that is, the same standard scores from different tests represent 
the same relative degree of achievement. Furthermore, standard scores from 
different tests may be added together or averaged. 
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After the standard score scale for each test had been established, 
a composite achievement score was computed for each student. This score 
is the average of an individual’s scaled scores on reading, English 
expression, and mathematics. Vocabulary scores were not included in the 
computation of composite achievement scores. 

Scores from the Questionnaire 

For purposes of developing scales from the pupil questionnaire, a 
spaced sample of approximately 7,000 seventh-grade students was drawn 
from those who participated in the Assessment Program. The statistical 
technique known as principal components factor analysis was used to create 
scales from the responses of these students to 25* of the 26 questions 
included in the General Information section of the Assessment Battery. 

This means that all of the responses to these questions were analyzed to 
determine which groups of items tended to measure the same pupil percep- 
tions. For example, questions such as "If you had your choice, would you 
rather go to a school other than this one?" and "Do you like school?" 
were indicated as measuring the same student perception: attitude toward 

school. These were only two of a number of items which contributed to 
this attitude scale. The statistical analysis then weighted each of the 
items in terms of its contribution to each scale. The end result of these 
analyses was the construction of four scales : a measure of socio-economic 



*The responses to the first item, "Are you a girl or a boy?", were not 
included in the analysis. 
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status and three measures of student attitude (importance of school 
achievement, self-perception, and attitude toward school). 

Inasmuch as the Michigan Assessment Battery was designed to measure 
the achievement and perceptions of groups of students rather than 
individuals, the scores of all students within a school were averaged to 
provide an estimate of the levels of educational performance and related 
factors for each school that participated in the program. These school 
average scores were then combined for all schools within each of the 
several districts in the state to indicate the district-wide status of 
the variables under consideration. These combinations were of two types: 

percentile distributions and averages. 

The percentile distributions contained in Assessment Reports 4, 5, 
and 6, are based on school average scores combined in various groupings; 
e.gc, districts and SES standings. School averages based on less than 
five students were not used in the preparation of these percentile 
distributions . 

Similarly, the unweighted district averages or means upon which the 
percentile distributions in Assessment Reports 4 and 5 are based did not 
include school averages for those schools where less than five students 
responded to the questions in the Assessment Battery. 
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Analysis of Selected Facets of the Assessment Battery 
Achievement 

Tests may be evaluated by determining several of their character- 
istics. The test characteristics that are important in evaluating group 
measures, such as those in this battery, include: validity, reliability, 

difficulty, and speededness. Each of these terms will be defined before 
the characteristics of either the fourth or seventh grade battery are 
discussed. 

The validity of a test is an indication of the extent to which it 
does the job it is intended to do. The most important type of validity 
for achievement tests is content validity. Content validity means that 
a test which claims to measure mathematics, for example, should contain 
questions in mathematics and that those questions should be appropriate 
for the grade level for which the test is intended. The content validity 
of a test is dependent upon the extent to which the questions represent a 
good sample of the topics that are involved in the subject tested. 

Content validity is ensured by entrusting the development of the 
tests to specialists in test construction and in the subject to be 
covered. The development of the achievement tests in the Michigan 
Assessment Battery has been described earlier and was designed to produce 
tests of high content validity. The success of these endeavors may be 
judged by examining the test copy. 
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The reliability of a test is an estimation of its consistency. A 
test is reliable if it consistently does the job it is intended to do; 
that is, if all of the questions are on the same general topic and if a 
student would receive roughly the same score if he could take the test 
more than once under the same conditions. The reliability of a test is 
reported as a coefficient; that is, a two-place decimal figure. A widely 
accepted rule of thumb states that a test possesses an acceptable degree 
of reliability if the coefficient is .80 or higher. The Kuder-Richardson 
Formula 20 was used in estimating the reliability of the basic skills 
tests in the Michigan Assessment Battery. 

The difficulty of a test is an indication of how well it is suited 
for the group being tested. A test that is appropriate for a given group 
should be of middle difficulty, and the scores obtained by that group 
should extend throughout the possible range of scores. Middle difficulty 
of a test consisting of four-choice items implies that the students would 
know the correct answer to half of the items and would get another quarter 
of them correct by guessing. 

One way of estimating the difficulty of a test is to convert the raw 
score average to a percentage of the number of items in the test. In 
these terms, middle difficulty for a four-choice test would be 62.5%. A 
percentage significantly higher than 62.5% would indicate an easy test; 
a percentage significantly lower than 62.5 % would indicate a hard one. 




20 



-19- 



Speededness is a measure of the extent to which test performance is 
affected by the time limit placed on the test. The criterion used in 
judging the speededness of a test is two-fold: the proportion of students 
who answered (1) the last question and (2) the question that is three- 
quarters of the way through the test. This does not mean that the 
students answered all the questions up to these points; it means that 
they reached these particular items. If 80 percent of the students 
complete the last item and virtually all the students complete 75 percen* - 
of the items, the test is usually judged to be unspeeded. 

GRADE 4 

A spaced sample of 1,005 student records was drawn from the records 
of all the fourth grade students who participated in the program. The 
responses of this sample were used to analyze the characteristics of the 
fourth grade Basic Skills Battery. The results are given in Table 2. 

The tabled reliability coefficients for the fourth grade Basic 
Skills Battery range from .83 for mathematics to .90 for reading, and 
the estimated reliability for the composite achievement measure is .95. 
According to the previously stated rule of thumb, all of these tests are 
reliable; the reading and composite achievement measures are highly so. 

The indications of difficulty presented in Table 2 show that the 
vocabulary test is difficult for the fourth grade students. The remaining 
tests — reading, English expression, and mathematics — fall within the 
acceptable range of middle difficulty. None are easy. 
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The vocabulary test is speeded; only two-thirds of the students 
reached the last item and only four-fifths of them got three-quarters of 
the way through the test. The three remaining tests are clearly unspeeded. 

A detailed description of the results of the analysis of the tests in 
the fourth grade Basic Skills Battery will be found in Appendix Bl. 

TABLE 2 

Characteristics of the Michigan Assessment 
Basic Skills Battery, 1969-70 

Grade 4, N = 1005 



English 





Vocabulary 


Reading 


Expression 


Mathematics \ 


Number of items 


50 


40 


45 


■> 

30 1 

•4 

1 


Average* 


21.8 


24.1 


26.3 


16.3 | 


Standard deviation* 


8.5 


8.2 


8.1 


5.7 | 


Reliability 


.87 


.90 


.88 


.83 ! 


Difficulty 


43.5% 


60.3% 


58.5% 


55.3% j 

•] 


Speededness: 








i 


Proportion reach Lng 
last item 


66% 


88% 


93%** 


y 

88% j 


Proportion reaching 
3/4 point 


86% 


97% 


98%** 


1 

97% j 



* In raw score units 

**Average for separately timed parts 
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GRADE 7 

A spaced sample of the records of 1005 students of all the seventh 
grade students who participated in the program was drawn for a detailed 
analysis of the characteristics of the tests in the seventh grade Basic 
Skills Battery. Table 3 shows the results of these analyses. 

The reliability coefficients for the seventh grade tests range from 
.82 to .88. The estimated reliability of the composite achievement scores 
is .94. All of these measures show acceptable to high reliability. 

All of the tests in the seventh grade battery proved to be of 
slightly greater than middle difficulty, but all of them fall within the 
acceptable range. None are easy. 

There is no evidence of undue speededness in any of the tests. In 
each instance, more than 80% of the students finished the tests and more 
than 90% of them reached the three-quarter point. 

A detailed description of the results of the analysis of the tests 
in the .seventh grade Basic Skills Battery will be found in Appendix B2. 
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TABLE 3 

Characteristics of the Michigan Assessment 
Basic Skills Battery, 1969-70 

Grade 7, N =* 1005 



Vocabulary 



Number of items 50 

Average* 27.3 

Standard deviation* 8.1 

Reliability . 86 

Difficulty 54.6% 

Speededness: 

Proportion reaching 

last item 84% 

Proportion reaching 

3/4 point 97% 



* In raw score units 

**Average for separately timed parts 



Reading 


English 

Expression 


Mathematics 


40 


55 


30 


22.4 


31.5 


16.0 


6.3 


8.9 


5.9 


.82 


.88 


.84 


55.9% 


57.2% 


53.3% 


90% 


96%** 


90% 


98% 


99%** 


98% 
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APPENDIX A 



TEST SPECIFICATIONS 
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VERBAL ABILITY SPECIFICATIONS: MICHIGAN TESTING PROGRAM 1969-70 



Verbal Ability -- (Analogies) 

20 minutes 

50 items 4 choice questions 



Bases for analogies: synonyms, antonyms, part-whole, P^t-part .caose- 

effect, object-purpose or action, class-sub-class, place- n 
measurement, expression of, associated with, character of, related to. 



Subject matters 
tools, weather, 
geography. 



animals, household, human relationships, transports, 
aesthetics, sports, measurement, abstract, genera , 



Vocabulary level in accordance with medium difficulty level for re 
spective grades. 
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Reading Test Content Specifications 
(1969-70) 



Grade 4 Grade 7 



I. WORDS 



10 items 10 items 



A. Synonymous (BEGIN: start) 

B. Associative (DRY: desert) 

C. Illustrative (RELUCTANT: "I’d 

really rather not.") 



II. SENTENCES 



A. Inference (External — tests word 

not in sentence but inferred 
by sentence) 

B. Comprehension (Internal— tests 

word in sentence, underlined 
or boldface — straightforward 
comprehension) 



10 items 



5 



5 



10 items 



5 



5 



III. READING COMPREHENSION 
A. Items 



20 items 20 items 



1. Factual (explicit) 

2. Interpretive (inference) 



_ . 6 passages 6 passages 

B. Passage Content ° pd ° B 

* 2 

1. Narrative J 2 

2. Science ^ - 

3. Social Studies 1 

4. Humanities 
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English Expression Test Content Specifications 

(1963-70) 



Grade 4 



Grade 7 



SPELLING 



15 items 



20 items 



A. 


Misunderstanding of rules for 
word formation 


4 


9 


B. 


Misunderstanding of rules for 
word transformation 


2 


5 


C. 


Reversing of letters 


2 


1 


D. 


Common mispronunciation 


1 


1 


E. 


Spelling by sound alone 


4 


4 


F. 


Other 


2 





II. 



EFFECTIVENESS OF EXPRESSION 



10 items 



10 items 



Questions testing effectiveness of expression ask the student 
to demonstrate his sensitivity to language by selecting the wording 
which, in addition to being grammatically correct, is best in 
sentence structure and word order and is most precise and appropriate 
in idiom and diction. 



Ill, 



GRAMMAR AND USAGE 



10 items 



13 items 



A. Subject-verb agreement 2 

B. Verb forms 2 

C. Double negative 1 

D. Adjective/adverb cor fusion 1 

E. Adjective forms (including a-an) 1 

F. Pronoun forms 1 

G. Pronoun-antecedent agreement 



4 

1 

1 

1 

1 

2 

1 



No error 



IV. 



PUNCTUATION AND CAPITALIZATION 



10 sentences 12 sentences 



A. Capital letter 



1. First word of sentence 

2. Proper names 

3. First person "I" 

4. In titles 

5 . In quotations 



4 

4 

1 

1 



10 

1 
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English Expression Test Content Specifications (Continued) 

(1969-70) 



IV. PUNCTUATION AND CAPITALIZATION (Cont’d) 



Grade 4 



B. Period 

1 . 

2 . 



C. 

D. 



End of Sentence 
Abbreviation 



Question Mark 



Comma 

1 . 

2 . 

3. 

4. 

5. 

6 . 



Address, date 
Direct address 
Series 

In quotations 

Apposition 

With exclamation 



F. Apostrophe 



1. Possessive 

2 . Contraction 



1 

1 



2 

1 

1 

1 



Quotation Marks 

1. Direct address, direct quote 1 



2 

1 



H. Semicolon 



Grade 7 



1 

2 

1 



3 

1 

2 

1 

1 



8 



4 

2 

1 
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Mathematics Test Content Specifications 

(1969-70) 









Grade 4 


Grade 7 


I. 


Number and Operations 


15 items 


10 items 




A. 


Operations with integers 


4 

o 


1 




B. 


Place value 


2 


1 




C. 


Properties of integers, divisibility 


2 

o 


1 




D. 


Proper fractions 


2 

A 


1 

1 




E. 


Decimals and per cents 


0 


1 




F. 


Properties of operations (commutative, 




A 






associative, distributive, closure) 


2 


2 

n 




G • 


Estimation 


1 


0 




H. 


Special properties of zero and one 


2 

r\ 


2 

1 




I. 


Average 


0 


X 


II. 


Geometry and Measurement 


3 items 


7 items 




A. 


Units of measure: length, weight, time, 




o 






temperature, money 


1 


4 

1 




B. 


Perimteres and areas of simple polygons 


1 


1 




C. 


Scale drawings and maps 


0 


1 

A 




D., 


Properties of polygons and the circle 


1 


2 




E. 


Angles and intuitive ideas of geometry 


0 


1 


III. 


Relations, Functions, Graphs 


2 items 


4 items 




A. 


Use of mathematical formula 


1 


1 

A 




B. 


Reading and interpreting graphs 


1 


3 


IV. 


Logical Thinking 


1 item 


1 item 




A. 


Intuitive ideas: counterexample, 










reasoning 


1 


1 


V. 


Mathematical Sentences 


2 items 


4 items 




A. 


Equations 


1 


2 

9 




B. 


Inequalities 


1 


4 




Mathematics Test Content Specifications (Continued) 

(1969-70) 



Applications 

A. Word problems (other than those 
already listed in one of the 
categories above) 

Note: At least one-third of the problems 

could be classified as applications. 



Grade 4 



7 items 



7 



Grade 7 
4 items 

4 



TOTAL 



30 items 



30 items 



APPENDIX B-l 
GRADE 4 TEST ANALYSIS 
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The Michigan Educational Assessment Program became operational during 
the 1969-70 academic year. Appropriate levels of the test battery, "Michigan 
Assessment of Basic Skills," Form SMT, were administered to all Michigan 
pupils in Grades A and 7 during January, 1970, The present report concerns 
the Grade A pupils. An outline of the test battery follows: 

Vocabulary (20 minutes) 

Reading (30 minutes) 

English Expression 

A. Spelling (5 minutes) 

B. Effectiveness of Expression (7 minutes) 

C. Grammar and Usage (6 minutes) 

D. Punctuation and Capitalization (7 minutes) 

Mathematics (25 minutes) 

Five scores are obtained for each pupil, namely, one score ba^ed on 
each of the four tests and a composite achievement score, which is the sum 
of the Reading, English Expression, and Mathematics scaled scores. 



NOTES ON PRINCIPAL FINDINGS 



Total Group 

158,713 Grade A pupils. 

Sample 

Spaced sample of 1,005 cases represents total group. 

Appropriateness of Tests to Group 

Scores extend throughout possible range on each of the four tests. 

Each mean is lower than middle difficulty for the group. 

Score Conversion 

Scores on each test converted to scaled with a mean of 50 and a standard 
deviation of 10. Scaled scores used in all reporting. 

Reliability 

Estimates of .869, Vocabulary; .899, Reading; .879, English Expression; 
.826, Mathematics; .9A7, Composite Achievement. 
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